Multi-interface symmetric multiprocessor

ABSTRACT

A symmetric multiprocessor system includes a first processor and a second processor for executing a multi-threaded process on packets, a first inbound interface and a first outbound interface associated with the first processor, a first task queue accessible for reading by the first processor, a second inbound interface and a second outbound interface associated with the second processor, and a second task queue accessible for reading by at least the first processor. The first inbound interface receives incoming packets and has a first input buffer maintaining a first input queue of the packets for processing by the first processor. The first outbound interface receives packets from the first processor and transmits outgoing packets. The first task queue receives packets output from at least the second processor and maintains another input queue of the packets for processing by the first processor and which are outgoing from the first outbound interface. The second inbound interface receives incoming packets and has a second input buffer maintaining a second input queue of the packets. The second outbound interface receives packets from the second processor and transmits outgoing packets. The second task queue receives packets output from at least the first processor and maintains another input queue of the packets for processing by the second processor and which are outgoing from the second outbound interface. The first processor executes a process thread on packets by requesting the packets from the first input queue and the first task queue in a predetermined manner. The second processor executes a process thread on packets by requesting the packets from the second input queue and the second task queue in a predetermined manner.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to the field of computer systems for data communications networks. More particularly, the present invention relates to a symmetric multiprocessor system having a plurality of interface devices, and a method for operating the system.

2. The Background

Routers are an essential element of internetworks, an example of which is the Internet. Routers allow (or deny, if necessary) communications between LANs (local area networks) and WANs (wide area networks). Routers transfer packets of data from one network to another and one major task of routers is the switching and forwarding of data packets. A router receives many packets through its multiple interfaces, and a network input/output (NetIO) process (program) and/or port ASIC (application specific integrated circuit) handles incoming packets received on inbound interfaces and outgoing packets sent to outbound interfaces. A router also performs other processes such as configuration, management, maintenance and updating of the system.

Recent rapid expansion of data communications networks has brought pressure for higher performance in network devices such as routers and switches in order to handle the increasing network traffic load. One solution for realizing improved performance of a computer system or a network device, such as a router, is to provide the system with a plurality of microprocessors (CPUs) that execute individual processes simultaneously. Such a microprocessor system, in which a number of microprocessors share memory, I/O (input/output) devices, and other resources, is called a symmetric multiprocessing (SMP) system. A SMP system typically uses a single operating system (OS) to control the microprocessors and a high-speed bus to provide communication among the components of the SMP system. Any idle processor in a SMP system can be assigned any task or process and additional microprocessors can be added to improve the performance of and deal with increased loads on the SMP system. The task of parceling out processes to each component microprocessor of the SMP system may be handled by the OS of the system or by an application program running outside the OS. Although symmetric multiprocessing can produce significant performance gains under ideal conditions, the resources of the system must be allocated to the concurrently running processes in a reasonable manner. In addition, overhead increases as more microprocessors are added to the system.

Multithreading of a process (or task) also generally provides improved performance of a computer system. Multithreading allows different parts (threads) of a single program or process to be executed simultaneously. A thread is a portion of a program or process that can operate independently of other portions of the program or process. In a multithreaded application, all the threads can run at the same time without interfering with one another, if so programmed. In a multiprocessor system, the OS, which is capable of executing such a multithreaded program, allots the threads of the program to the microprocessors. However, it is often difficult to divide a program in such a way that separate microprocessors can execute different portions of it without interfering with one another.

If a router utilizes a SMP system (without multithreading), the processes performed by the router can be executed concurrently by several microprocessors. For example, a NetIO process may run on one microprocessor, while other processes may run on other microprocessors. However, if the NetIO process restricts itself to single-threaded operation, its input/output performance is limited by the microprocessor executing NetIO process. The process drops packets when it is overloaded despite the fact that there may be a number of idle microprocessors in the SMP system which could, but are not adapted to handle the load. Alternatively, the NetIO process can be multithreaded so that each thread is processed by an available microprocessor. For instance, a first packet received on an interface is handled by a first microprocessor, the following packet is handled by a second microprocessor while the first microprocessor is busy processing the first packet, and so forth.

In this scenario of a multithreaded NetIO process, each interface of the router can be used by any of the available microprocessors at any time. A NetIO forwarding process typically receives a packet on an inbound interface, switches the packet based on its destination address, and sends the packet to an outbound interface. Thus, one switching and forwarding process for a packet usually involves an inbound and an outbound interface, the result being that each of the microprocessors executing the NetIO process may have to access any interface of the router. Therefore, it is necessary to adopt some measure such as a mutex or scheduler to prevent more than one microprocessor from accessing the same interface at the same time. That is, the “right to use” each interface device must be dynamically allocated among the microprocessors and must not overlap. For example, using a scheduler, a process thread on each microprocessor is “awakened” when the interface becomes available to that microprocessor. However, this approach has the undesirable effect of dramatically slowing the router's switching performance.

Moreover, the multithreaded NetIO process may cause another problem known as “packet reordering” when successive packets arriving at an interface constitute a unit of information such as a single data file or an audio or video stream. When the first arriving packet is handled by a first microprocessor and the following packet by a second microprocessor, the second microprocessor may reorder the packets either because it thinks that the second packet should have arrived before the first packet or simply because it completes its task before the first microprocessor. When the reordered packet arrives on the interface, a third microprocessor, rather than the second microprocessor, may handle that third packet, resulting in another potential packet reordering. If the NetIO process is to avoid having more than one microprocessor handle the different interfaces, it is required to perform context switching (forcing one microprocessor to handle a particular packet stream so as to maintain packet ordering within the stream). Such context switching also significantly increases overhead and thus lowers the observed performance of the router or switch.

BRIEF DESCRIPTION OF THE INVENTION

A symmetric multiprocessor system includes a first processor and a second processor for executing a multi-threaded process on packets, a first inbound interface and a first outbound interface associated with the first processor, a first task queue accessible for reading by the first processor, a second inbound interface and a second outbound interface associated with the second processor, and a second task queue accessible for reading by at least the first processor. The first inbound interface receives incoming packets and has a first input buffer maintaining a first input queue of the packets for processing by the first processor. The first outbound interface receives packets from the first processor and transmits outgoing packets. The first task queue receives packets output from at least the second processor and maintains another input queue of the packets for processing by the first processor and which are outgoing from the first outbound interface. The second inbound interface receives incoming packets and has a second input buffer maintaining a second input queue of the packets. The second outbound interface receives packets from the second processor and transmits outgoing packets. The second task queue receives packets output from at least the first processor and maintains another input queue of the packets for processing by the second processor and which are outgoing from the second outbound interface. The first processor executes a process thread on packets by requesting the packets from the first input queue and the first task queue in a predetermined manner. The second processor executes a process thread on packets by requesting the packets from the second input queue and the second task queue in a predetermined manner.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram schematically illustrating an example of the structure of a symmetric multiprocessor system in accordance with one presently preferred embodiment of the present invention.

FIG. 2 is a diagram illustrating an example of an internetwork.

FIG. 3 is a diagram showing an example of a routing table for a router.

FIG. 4 is a diagram illustrating an example of encapsulation and decapsulation processes performed on a packet sent through data-links of different types.

FIG. 5 is a diagram illustrating the OSI (Open Systems Interconnect) reference model.

FIG. 6 is a diagram schematically illustrating a portion of a symmetric multiprocessor system in accordance with an embodiment of the present invention.

FIG. 7 is a functional block diagram illustrating operations of a symmetric multiprocessor system in accordance with an embodiment of the present invention.

FIG. 8 is a process flow diagram illustrating a typical process thread executed on an incoming packet.

FIG. 9 is a process flow diagram illustrating a typical process thread executed on an outgoing packet.

FIG. 10 is a functional block diagram illustrating operations of a symmetric multiprocessor system in accordance with an embodiment of the present invention.

FIG. 11 is a functional block diagram illustrating operations of a symmetric multiprocessor system in accordance with another embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Those of ordinary skill in the art will realize that the following description of the present invention is illustrative only and not in any way limiting. Other embodiments of the invention will readily suggest themselves to such skilled persons having the benefit of this disclosure.

FIG. 1 schematically illustrates an example of the structure of a symmetric multiprocessor system 100 according to an embodiment of the present invention. The symmetric multiprocessor system 100 includes a plurality of microprocessors (CPUs) 10 (10-1,10-2, . . . , 10-n, where n is an integer greater than 1) (collectively referred to as 10), and a plurality of interface devices 12 (12-1, 12-2, . . . , 12-m, where m is an integer greater than 1) (collectively referred to as 12). The system 100 also includes a main memory (RAM) 16, a read only memory (ROM) 18, and a controller 20, which are connected to the microprocessors 10 via a bus 14. The controller 20 may be implemented in a controller chip or another microprocessor.

In this embodiment, the multiprocessor system 100 is utilized in a network device such as a router, though the present invention is applicable to any computer system having multiple microprocessors and multiple interface devices. In order to support various types and methods of communication and different protocols (network protocols as well as switching protocols), a router typically includes multiple interface devices. For example, as shown in FIG. 1, the SMP system 100 (as a router), may include one or more interface devices 12 such as Ethemet (port EO), Fast Ethemet (port E1), Token Ring (port To0), Serial Connection (port S0), ATM (port S1), Frame Relay (port S2), and similar data communications interfaces. Not all of these types need to be present and there may be multiple instances of one or more types.

The SMP system 100 may employ an operating system such as the Cisco IOS™ product, available from Cisco Systems, Inc. of San Jose, Calif., or another convenient operating system to handle a multiprocessor system with multiple interface devices. Packets are received on and sent from interface devices 12. Each of the interface devices 12 preferably includes an inbound interface for receiving packets and an outbound interface for sending data packets, although there is no requirement that each interface support two-way communication. Interface devices 12 are connected to microprocessors 10 and controller 20 via a bus 14.

FIG. 2 illustrates an example of an internetwork 200. The internetwork 200 includes a variety of LANs (such as Ethemet network 1.0 and Token Ring network 2.0) interconnected through private and public (Internet) WAN links (such as network 3.0). These links are implemented with network devices such as routers and switches. For example, packet traffic sent from a source host HI at address 1.3 on Ethemet network 1.0 to a destination host H2 at address 2.1 on Token Ring network 2.0 is routed by a router R1 (LAN-to-LAN routing). The router R1 is responsible for finding the path for the LAN hosts. A packet sent from a workstation H3 at address 2.2 on the Token Ring LAN 2.0 traverses three data links (Token Ring, Frame Relay, and Ethemet) to reach a file server H4 at address 4.1 on Ethemet network 4.0. This is an example of a LAN-to-WAN-to-LAN routing. The router R1, frame relay switches on the network 3.0, and router R2 participate in this routing.

As shown in FIG. 2, the router R1 is connected to four networks: Ethemet network 1.0, Token Ring network 2.0, network 3.0 (Frame Relay WAN), and network 5.0 (serial connection), and the router R1 is responsible for forwarding packets from one of these networks to another. In order to determine the best path to use to forward a packet, a router has a routing table. FIG. 3 shows an example of a routing table for the router R1. The routing table contains an association between destination network addresses and the corresponding output interfaces or ports. When the router R1 checks its routing table entries for the packet from the source host H1 (address 1.3) to destination host H2 (address 2.1), it discovers that the path to the destination address 2.1 uses an outgoing port To0, the interface to the Token Ring LAN 2.0.

As an internetwork grows, the path taken by a packet includes several relay points and a variety of data-link types, such as the path of the packet from the source host H3 to the destination sever H4. Encapsulation and decapsulation enables a packet to be sent over these different data links. FIG. 4 illustrates an example of such encapsulation and decapsulation performed on a packet sent from the workstation H1 to the file server H4. This illustration is in accordance with the OSI (Open Systems Interconnect) reference model as shown in FIG. 5, which is well known to those of ordinary skill in the art.

As shown in FIG. 4, when the workstation H1 at address 2.2 sends a packet to the file server H4 at address 4.1, the packet is encapsulated in a Token Ring frame addressed to the router R1 at the data-link layer (data-link address in a frame header) and to the file server H4 at the network layer (network address 4.1 in a network header). When the router R1 receives the frame, it removes the packet from the Token Ring frame (that is, strips the frame header of the packet), finds a path for it, encapsulates it in a Frame Relay frame, and forwards the frame to the router R2. The router R2 removes the packet from the Frame Relay frame, and then forwards the packet to the file sever H4 encapsulating it in a newly created Ethemet frame. When the file server H4 receives the Ethemet frame, it extracts the packet and passes it to the appropriate upper-layer process. The routers R1 and R2 keep the source address and the destination address in the network layer unchanged, while appending a respective datalink layer address at the interface. The data-link layer address is an address appropriate for the next hop along the path. This encapsulation enables the LAN-to-WAN packet flow.

Returning to FIG. 1, the operating system of the SMP 100 is capable of executing a multithreaded program such as a network input/output (NetIO) process handling packets incoming and outgoing through various interfaces of SMP 100. The microprocessors 10 execute such a multithreaded process. The controller 20 controls the data flow between the microprocessors 10 and interface devices 12 and other resources such as RAM 16 and ROM 18.

In the SMP system 100, each interface device 12 is controlled by one designated microprocessor (or group of microprocessors acting as a unit) 10. Two or more interface devices 12 can be associated with one such designated microprocessor. One specific interface device 12 is not controlled by more than one such designated microprocessor.

FIG. 6 illustrates a SMP system 300 according to a specific embodiment of the invention. Each interface device 34, 36 usually includes both an inbound interface 34 a, 36 a and an outbound interface 34 b, 36 b. SMP system 300 additionally includes at least a first microprocessor 30 and a second microprocessor 32. First interface device 34 is in communication with first microprocessor 30 and second interface device 36 is in communication with second microprocessor 32. Each microprocessor runs, for example, a packet forwarding thread and is responsible for switching any traffic on its corresponding interface device. In such a case, each microprocessor may be directly connected to the corresponding interface device, if desired, rather than being coupled to it via a controller.

The first and second microprocessors 30 and 32 are connected via a bus 114. The system 300 may also include a controller, a ROM, and other resources (not shown) which may also be connected to bus 114. Shared RAM 33 is also provided and is connected to bus 114 so that it may be accessed by both first microprocessor 30 and second microprocessor 32. First and second task queues 35 a, 35 b are maintained in shared RAM 33 for the purpose of providing communication between block 31 a associated with first microprocessor 30 and block 31 b associated with second microprocessor 32.

Any microprocessor may post (atomically or even non-atomically, but atomic posts are faster) to any other microprocessor's task queue, identifying a list of tasks that must be performed by the microprocessor associated with the task queue. (Tasks usually consist of switching a packet the rest of the way, but interface control commands could be posted there as well). Since each microprocessor governs a set of the available interfaces, all data/instructions for a particular interface are automatically routed to the correct microprocessor and posted to its task queue. Each microprocessor also listens for incoming packets on the interfaces that it owns. For example, assume that first microprocessor 30 “owns” first interface 34 and second microprocessor 32 owns second interface 36 and a packet arrives at first interface 34. As part of its normal operation, each microprocessor scans the interfaces it owns as well as its own task queue. First microprocessor 30 finds that a packet has arrived on first interface 34, so it reads the packet and performs decapsulation. First microprocessor 30 eventually determines that the packet needs to be routed out of second interface 36. First microprocessor 30 identifies the “owner” of second interface 36 as second microprocessor 32 (from a configuration table look-up, for example) and then constructs a task containing a request for further data processing; this task contains the pointer to the packet and the requested output interface (second interface 36). First microprocessor 30 then posts this task request to the task queue for second interface 36, which is second task queue 35 b, the task queue for second microprocessor 32. First microprocessor 30 then continues with its normal processing; it has finished the work it needs to do for this packet.

Second microprocessor 32, as part of its normal operations, scans its task queue, second task queue 35 b, and discovers that there is work to be done, so it picks up a task from the queue and begins processing it. In this case, it is the packet that first microprocessor 30 posted there; second microprocessor 32 then does the encapsulation work for the interface designated in the task request, and then sends the modified packet out the requested interface (second interface 36). Second microprocessor 32 then continues with its work, scanning the interfaces that it owns as well as its task queue (second task queue 35 b).

FIG. 7 is a functional block diagram illustrating operation of a SMP system 400 in accordance with another specific embodiment of the present invention. The system 400 includes at least a first processor 40 and a second processor 50, a first interface 42 comprising a first inbound interface 42 a and a first outbound interface 42 b, both associated with first processor 40, a first task queue 46 accessible for reading by first processor 40, a second interface 52 comprising a second inbound interface 52 a and a second outbound interface 52 b, both associated with second processor 50, and a second task queue 56 accessible for reading by second processor 50.

The first and second processors 40 and 50 execute a multi-threaded process on packets, such as a network input output (NetIO) process. First processor 40 executes a first process thread of the NetIO process, and second processor 50 executes a second process thread of the NetIO process. First inbound interface 42 a receives incoming packets and has an associated first input buffer 44 into which are placed received packets (or pointers thereto) to be processed by first processor 40. First outbound interface 42 b receives packets output from first processor 40 and transmits the packets to their respective destinations. First task queue 46 receives packets output from at least second processor 60. The packets output to first task queue 46 are packets which are to be sent out through first outbound interface 42 b. First task queue 46 maintains a queue of the packets to be processed by the first processor 40.

First processor 40 accesses the first input buffer 44 and first task queue 46 in a predetermined manner, and requests packets to process. After executing a process thread, first processor 40 outputs a packet to the first outbound interface 42 b via first output buffer 45 when the packet is to be transmitted from the first outbound interface 42 b. First processor 40 places a processed packet in second task queue 56 when the packet is to be transmitted from second outbound interface 52 b. Accordingly, first processor 40 does not directly “touch” second outbound interface 52 b, but enqueues the packet in second task queue 56 so that it is handled by the second processor 50 which is responsible for second interface 52.

Similarly, the second inbound interface 52 a receives incoming packets. The second inbound interface 52 a has an associated second input buffer 54 into which are placed received packets (or pointers thereto) to be processed by second processor 50. Second outbound interface 52 b receives packets output from the second processor 50 via second output buffer 47 and transmits the packets to their respective destinations. Second task queue 56 receives packets output from at least first processor 40. Such packets are output to second task queue 56 when they are to be sent out through second outbound interface 52 b which is controlled by second processor 50. Second task queue 56 maintains some of the packets to be processed by second processor 50. Second processor 50 sends a packet to second outbound interface 52 b via second output buffer 47 when the packet is to be transmitted from second outbound interface 52 b, and sends a packet to first task queue 46 when the packet is to be transmitted from first outbound interface 42 b. In this manner, second processor 50 does not directly use first outbound interface 42 b, but enqueues the packet in first task queue 46 so that it can be handled by first processor 40.

The SMP system 400 may also include a third processor 70 for executing another process or processes. The system 400 may also include one or more additional processors. Such additional processors or processes may include an upper layer process or application using the data contained in the packet. For instance, if a packet received on the first inbound interface 42 a has as its destination system 400 itself, the packet should be handed to and processed by the appropriate process or application at system 400 rather than forwarded by the NetIO process. In such a case, first processor 40 transfers the packet to a third processor 70, which executes the application or other process or processes. Preferably, processor 70 has a buffer (not shown) and the first processor 40 places the packet in a queue in that buffer. Such a packet received on the second inbound interface 52 a is handled by the second processor 50 in the same way.

Processes running on third processor 70 may want to send a packet from one of the outbound interfaces 42 b and 52 b. For example, system 400 may need to send its update information to other computer systems (such as neighboring routers) on an internetwork. When third processor 70 wants to send a packet through the first outbound interface 42 b, it places the packet into first task queue 46. This allows first processor 40 to handle the packet instead of letting third processor 70 use the first outbound interface 42 b which could cause a conflict. Similarly, a packet from third processor 70 that is to be sent out from the second outbound interface 52 b is placed into second task queue 56, so as to be processed by the second processor 50. Packets processed in this manner may include, for example, routing table updates, pings, trace route requests, and the like.

The SMP system 400 may also include a discard buffer 72 for packets to be discarded when the packet has no valid destination or the packet's attempted access is unauthorized or should be prohibited. The first and second processors 40 and 50, and third processor 70, if present, place such a packet in the discard buffer 72, or simply drop it.

FIGS. 8 and 9 show examples of a process thread executed on a requested packet by the first and second processors 40 and 50. Since the roles of the first and second processors 40 and 50 are symmetric in system 400, the operation is explained for the first processor 40 as an example.

The first processor 40 polls first input buffer 44 and first task queue 46 in a predetermined manner to determine if there is a packet waiting to be processed. For example, first processor 40 accesses first input buffer 44 and first task queue 46 alternately so as to scan the first input queue and the first buffer queue maintained therein. If there is any packet in the queues, first processor 40 obtains the packet and performs the process thread on it. First processor 40 may request packets from the first input buffer 44 a predetermined number of times and then from first task queue 46 a predetermined number of times. Alternatively, an interrupt system based upon interrupt generation upon packet receipt at either first task queue 46 or first input buffer 44 may be used. Each processor may operate similarly.

FIG. 8 illustrates a typical process thread executed on an incoming packet. First processor 40 makes a request for a packet from the first input queue as described above at block 110. The packet received on first inbound interface 42 a has been encapsulated in a frame specific to the data-link type of the interface device, such as Ethemet, Token Ring, or the like. The packet goes through a decapsulation process at block 42 to be stripped of its frame header, and the destination network address is examined at block 114.

If the packet is to be processed by another process (block 116), the packet is passed to the appropriate process and the NetIO process thread has completed its work on that packet. For example, when a packet is addressed to the system 400 and contains data for another service or application, such as updating information for a routing table, the packet is transferred to (for example) third processor 70. The packet may be enqueued in a buffer of third processor 70 executing the appropriate process at block 118.

When the packet is to be switched or forwarded, a routing table is checked to determine whether the packet is routable or bridgeable at block 120. If there is no routing table entry matching the destination network address (invalid destination), the packet is neither routable nor bridgeable and is therefore dropped from the process as an unwanted packet. The packet may be simply discarded at block 122 and the sender may optionally be notified of the unreachable destination.

If the matching entry is found in the routing table, the path for the packet is determined in accordance with the routing table at block 124. An entry of the routing table indicates, for example, a destination network, some routing metric or state, and an outbound interface (or port) to use for forwarding the packet. The outbound interface is selected accordingly at block 126. The packet is placed in the task queue for the appropriate processor handling the selected interface (in this case, second task queue 56) at block 128. The remaining work on the packet (encapsulation and forwarding for transmission, for example) is processed by second processor 50.

If the destination network of a packet is the same as the source network, that is, the source host and the destination host are on the same network, the packet is sent out from first outbound interface 42 b. In this case, the packet may be placed in first task queue 46 so that the packet is subject to an encapsulation process carried out by first processor 40 when it is requested from first buffer queue (as shown in FIG. 9). Alternatively, first processor 40 may complete the forwarding (encapsulation) process on that packet and output it to first outbound interface 42 b, if the process thread is so programmed.

FIG. 9 illustrates a typical process thread executed on an outgoing packet. First processor 40 makes a request for a packet from first input buffer 44 in a manner as described above at block 140. The destination of the packet is examined at block 142 and the packet is encapsulated into a frame having the appropriated data-link address at block 144. The frame (or the frame header) is specific to the data-link type of first outbound interface 42 b and suitable for communicating with the next directly connected network device on the link. Then, the packet is transmitted from the outbound interface at block 146.

A process thread is not limited to those examples shown in FIGS. 8 and 9. For example, a process thread on an outgoing packet may include another procedure and/or a link to another process such as a packet traffic management procedure for permitting or denying traffic. Such a process may be based on the source address, destination address, and/or protocol. Such traffic management may be performed, for example, by using an outbound access list defining the type of traffic that is permitted or restricted. Alternately, the traffic management may be executed by another processor by providing a link thereto (enqueuing) after examining the destination of the packet. In addition, this traffic management may be included in or linked from the process thread performed on an incoming packet by using an inbound access list.

Although each processor is responsible for switching any traffic arriving on and leaving from “its” assigned interface device, another task or process may temporarily need to have control (or “ownership”) over the interface. For example, a chain maintenance task may need to access each interface device in the system. Thus, the right to control the interface device may have to be arranged in a certain manner. The polling operation by each processor may need to provide a break for allowing such an outside task or process to do such maintenance. This may be implemented, for example, by calling for a maintenance task after the processor executes a certain number of process threads or after processing a predetermined number of packets.

FIG. 10 is a functional block diagram illustrating operations of a SMP system 500 according to another specific embodiment of the present invention. The system 500 includes a fourth processor 60, a third inbound interface 62 a and a third outbound interface 62 b, both associated with the fourth processor 60, and a third task queue 66 accessible for reading by the fourth processor 60, in addition to the components of the system 400 as described above. The processors operate in the same general manner as those in the system 400.

FIG. 11 is a functional block diagram illustrating operations of a SMP system 600 according to another specific embodiment of the present invention. The system 600 includes a first processor 80 and a second processor 90, a first inbound interface 81 a, a first outbound interface 81 b, a second inbound interface 82 a and a second outbound interface 82 b, all associated with the first processor 80, a first task queue 86 accessible for reading by the first processor 80, a third inbound interface 91 a, a third outbound interface 91 b, a fourth inbound interface 92 a and a fourth outbound interface 92 b, all associated with the second processor 90, and a second task queue 96 accessible for reading by the second processor 90. The other structure of the system 600 is the same as that of system 400.

In the system 600, each microprocessor is responsible for two interface devices, each including an inbound interface and an outbound interface. If a packet is to be sent from one of the two outbound interfaces, the packet is enqueued in the task queue of the corresponding microprocessor. For example, first microprocessor 80 places a packet in second task queue 96 when the packet is to be sent from one of the third outbound interface 91 b and the fourth outbound interface 92 b. Second microprocessor 90 further processes the packet to assign it to one of its associated outbound interfaces 91 b and 92 b in accordance with the destination of the packet. Other operations of the system 600 are the same as those of the system 400.

Note that while each example shown herein includes an inbound and outbound interface corresponding to each interface device, one-way interface devices are also possible, such as input-only or output-only devices. It is also possible that the entire system might have a single inbound interface or a single outbound interface. The specific embodiments shown and described are more general and have wider applicability. The microprocessor and the corresponding interface device may be provided on the same card such as an Ethemet line card. The number of processors is not limited to two, as explained above, and the SMP system may contain any number of microprocessor-interface device set(s) and additional microprocessor(s) as deemed appropriate for the application. Finally, “Atomic writes” allow multiple queues to operate efficiently, so it is a preferable embodiment, but atomic instructions are not required for the invention to work.

While embodiments and applications of this invention have been shown and described, it would be apparent to those of ordinary skill in the art having the benefit of this disclosure that many more modifications than mentioned above are possible without departing from the inventive concepts herein. The invention, therefore, is not to be restricted except in the spirit of the appended claims. 

What is claimed is:
 1. A symmetric multiprocessor system comprising: a first processor and a second processor for executing a multi-threaded process on packets; a first inbound interface device associated with said first processor, said first inbound interface device receiving incoming packets and having a first input buffer accessible by said first processor, said first input buffer maintaining a queue of the packets received on said first inbound interface device; a first outbound interface device associated with said first processor, said first outbound interface device receiving packets output from said first processor and transmitting said packets therefrom; a first task queue accessible for reading by said first processor, said first task queue receiving packets output from at least said second processor, the packets in said first task queue being transmitted from said first outbound interface device; a second inbound interface device associated with said second processor, said second inbound interface device receiving incoming packets and having a second input buffer accessible for reading by said second processor, said second input buffer maintaining a queue of the packets received on said second inbound interface device; a second outbound interface device associated with said second processor, said second outbound interface receiving packets output from said second processor and transmitting said packets therefrom; and a second task queue accessible for reading by said second processor, said second task queue receiving packets output from at least said first processor. The packets in said second task queue being transmitted from said second outbound interface device.
 2. A symmetric multiprocessor system comprising: a first processor and a second processor for executing a multi-threaded process on packets; a first inbound interface device associated with said first processor, said first inbound interface device receiving incoming packets and having a first input buffer accessible by said first processor, said first input buffer maintaining a queue of the packets received on said first inbound interface device; a first outbound interface device associated with said first processor, said first outbound interface device receiving packets output from said first processor and transmitting said packets therefrom; a first task queue accessible for reading by said first processor, said first task queue receiving packets output from at least said second processor, the packets in said first task queue being transmitted from said first outbound interface device; a second outbound interface device associated with said second processor, said second outbound interface receiving packets output from said second processor and transmitting said packets therefrom; and a second task queue accessible for reading by said second processor, said second task queue receiving packets output from at least said first processor. The packets in said second task queue being transmitted from said second outbound interface device.
 3. A symmetric multiprocessor system according to claim 1, wherein said first processor executes a first process thread on packets by requesting packets from said first input buffer and said first task queue in a predetermined manner.
 4. A symmetric multiprocessor system according to claim 1, wherein said second processor executes a second process thread on packets by requesting packets from said second input buffer and said second task queue in a predetermined manner.
 5. A symmetric multiprocessor system according to claim 1, wherein said first processor sends a packet to said second task queue when the packet is to be transmitted from said second outbound interface device.
 6. A symmetric multiprocessor system according to claim 1, wherein said second processor sends a packet to said first task queue when the packet is to be transmitted from said first outbound interface device.
 7. A symmetric multiprocessor system according to claim 1, further comprising a third processor executing a process.
 8. A symmetric multiprocessor system according to claim 7, wherein said first task queue also receives packets output from said third processor.
 9. A symmetric multiprocessor system according to claim 7, wherein said second task queue also receives packets output from said third processor.
 10. A symmetric multiprocessor system according to claim 7, further comprising a third task queue accessible for reading by said third processor, wherein said first processor sends a packet to said third task queue when the packet is to be processed by said process executed by said third processor.
 11. A symmetric multiprocessor system according to claim 1 wherein said first and second processors operate under instructions to perform at least one of (a) decapsulating a packet; (b) examining the destination of a packet; (c) determining a path of a packet; (d) selecting an outbound interface device for a packet; and (e) encapsulating a packet.
 12. A symmetric multiprocessor system according to claim 11 wherein said first and second processors send a packet to a buffer for packets to be discarded when the packet has no valid destination.
 13. A symmetric multiprocessor system according to claim 1, further comprising: a third inbound interface device associated with said first processor, said third inbound interface device receiving incoming packets and having a third input buffer accessible for reading by said first processor, said third input buffer maintaining a queue of the packets received on said third inbound interface device; and a third outbound interface device associated with said first processor, said third outbound interface device associated with said first processor and receiving packets output from said first processor and transmitting said packets therefrom.
 14. A symmetric multiprocessor system according to claim 13, wherein said second processor sends a packet to said first task queue when the packet is to be transmitted from one of said first and third outbound interface devices.
 15. A symmetric multiprocessor system according to claim 13, wherein said first processor requests packets from said first input buffer, said third input buffer, and said first task queue in a predetermined manner.
 16. A symmetric multiprocessor system according to claim 1, further comprising: a fourth inbound interface device associated with said second processor, said fourth inbound interface device receiving incoming packets and having a fourth input buffer accessible for reading by said second processor, said fourth input buffer maintaining a queue of the packets received on said fourth inbound interface device; and a fourth outbound interface device associated with said second processor, said fourth outbound interface device receiving packets output from said second processor and transmitting said packets therefrom.
 17. A symmetric multiprocessor system according to claim 16, wherein said first processor sends a packet to said second task queue when the packet is to be transmitted from one of said second and fourth outbound interface devices.
 18. A symmetric multiprocessor system according to claim 16, wherein said first processor requests packets from said second input queue, said fourth input queue, and said second task queue in a predetermined manner.
 19. A method for operating a symmetric multiprocessor system including a first processor and a second processor for executing a multi-threaded process on packets, a first inbound interface device, a first outbound interface device, a first task queue associated with said first processor, a second inbound interface device, a second outbound interface device, and a second task queue associated with said second processor, said method comprising: receiving incoming packets on said first inbound interface device; maintaining a queue of packets received on said first inbound interface device in a first input buffer; receiving packets in said first task queue, said packets received from at least said second processor, said packets for transmission from said first outbound interface device; maintaining a queue of packets received in said first task queue; requesting a packet from one of said first input buffer and said first task queue in a predetermined manner; executing a first process thread on said requested packet; sending said packet processed by said first process thread to said second task queue when said packet is to be transmitted from said second outbound interface device; and outputting said packet processed by said first process thread to said first outbound interface device when said packet is to be transmitted from said first outbound interface device.
 20. A method according to claim 19, further comprising: receiving incoming packets on said second inbound interface device; maintaining a queue of packets received on said second inbound interface device in a second input buffer; receiving packets in said second task queue, said packets received from at least said first processor, said packets for transmission from said second outbound interface device; maintaining a queue of packets received in said second task queue; requesting a packet from one of said second input buffer and second task queue in a predetermined manner; executing a second process thread on said requested packet; sending said packet processed by said second process thread to said first task queue when the packet is to be transmitted from said first outbound interface device; and outputting said packet processed by said second thread to said second outbound interface device when said packet is to be transmitted from said second outbound interface device.
 21. A method according to claim 19, further comprising executing a third process by a third processor, wherein said receiving packets in said first task queue includes receiving packets output from said third processor.
 22. A method according to claim 21, wherein said system further includes a third task queue accessible for reading by said third processor, and said method further comprises: sending a packet processed by said first process thread to said third task queue when the packet is to be processed by said third process.
 23. A method according to claim 20, wherein said receiving packets in said second task queue includes receiving packets output from said third processor, and said method further comprises: executing a third process on a third processor.
 24. A method according to claim 23, wherein said system further includes a third task queue accessible for reading by said third processor, and said method further comprises: sending a packet processed by said second process thread to said third task queue when the packet is to be processed by said third process.
 25. A method according to claim 19, wherein said executing a first process thread includes at least one of: decapsulating a packet; examining a destination of a packet; determining a path of a packet' selecting an outbound interface device for a packet and encapsulating a packet.
 26. A method according to claim 20, wherein said executing a second process thread includes at least one of: decapsulating a packet; examining a destination of a packet; determining a path of a packet,; selecting an outbound interface device for a packet; and encapsulating a packet.
 27. A method according to claim 20, wherein said system includes a third inbound interface device and a third outbound interface device, and said method further comprises: receiving incoming packets on said third inbound interface device; maintaining a queue of the packets received on said third inbound interface device in a third input buffer; requesting a packet from said third input buffer in a predetermined manner; executing said first process thread on said packet; sending said packet processed by said first process thread to said second task queue when said packet is to be transmitted from said second outbound interface device; outputting said packet processed by said first process thread to said third outbound interface device when said packet is to be sent from said third outbound interface device; and sending a packet processed by said second process thread to said first task queue when said packet is to be transmitted from said third outbound interface device.
 28. A method according to claim 27, wherein said system includes a fourth inbound interface device and a fourth outbound interface device, and said method further comprises: receiving incoming packets on said fourth inbound interface device; maintaining a queue of the packets received on said fourth inbound interface device in a fourth input buffer; requesting a packet from said fourth input buffer in a predetermined manner; executing said second process thread on said packet; sending said packet processed by said second process thread to said first task queue when said packet is to be transmitted from one of said first outbound interface and said third outbound interface devices; and outputting said packet processed by said second process thread to said fourth outbound interface device when said packet is to be sent from said fourth outbound interface device.
 29. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a method for operating a symmetric multiprocessor system including a first processor and a second processor for executing a multi-threaded process on packets, a first inbound interface device, a first outbound interface device, a first task queue associated with said first processor, a second inbound interface device, a second outbound interface device, and a second task queue associated with said second processor, said method comprising: receiving incoming packets on said first inbound interface device; maintaining a queue of packets received on said first inbound interface device in a first input buffer; receiving packets in said first task queue, said packets received from at least said second processor, said packets for transmission from said first outbound interface device; maintaining a queue of packets received in said first task queue; requesting a packet from one of said first input buffer and said first task queue in a predetermined manner; executing a first process thread on said requested packet; sending said packet processed by said first process thread to said second task queue when said packet is to be transmitted from said second outbound interface device; and outputting said packet processed by said first process thread to said first outbound interface device when said packet is to be transmitted from said first outbound interface device.
 30. A symmetric multiprocessor system comprising: a first processor; a first inbound interface associated with said first processor, said first inbound interface receiving task for processing by said first processor; a second processor; a second inbound interface associated with said second processor, said second inbound interface receiving tasks for processing by said second processor; a task queue readable by said second processor and writeable by said first processor, said task queue receiving task for processing by said second processor; an outbound interface associated with said second processor, said outbound interface capable of receiving an output associated with tasks performed by said second processor; and a task switcher selecting a task for performing by said second processor from among said second inbound interface and said task queue.
 31. A symmetric multiprocessor system comprising: a first processor; a first inbound interface associated with said first processor, said first inbound interface receiving tasks for processing by said first processor; a first outbound interface associated with said first processor, said first outbound interface capable of receiving an output associated with tasks performed by said first processor; a second processor; a second inbound interface associated with said second processor, said second inbound interface receiving tasks for processing by said second processor; a second outbound interface associated with said second processor, said second outbound interface capable of receiving an output associated with tasks performed by said second processor; a first task queue readable by said second processor and writeable by said first processor, said first task queue receiving tasks for processing by said second processor; a second task queue readable by said first processor and writeable by said second processor, said second task queue receiving tasks for processing by said first processor; a first task switcher selecting a task for performing by said first processor from among said first inbound interface and said first task queue; and a second task switcher selecting a task for performing by said second processor from among said second inbound interface and said second task queue. 